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SEQUENCE LISTING 



<110> 



<120> 



METHODS AND COMPOSITIONS FOR SIMULTANEOUS 
SACCHARIFICATION AND FERMENTATION 



<130> 



BCI-024CP 



<140> 
<141> 



09/885,297 
2001-09-19 



X 




60/214, 137 
2000-06-26 



<150> 60/219,913 
<151> 2000-07-21 

<160> 17 

<170> PatentIn Ver. 2.0 

<210> 1 
<211> 450 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 1 

ctttttcggc atgagcaacc aacattttca aggtatcatc ctgatgcgca atatcggcat 60 
cggttagcca taaccatttt acctgtccgg cggccttaat accttgatca gatggttcgt 120 
ggtgttgtta ccttgccgaa gggcaccggt aaaaatgttc gcgtcggtgt tttcgcccgt 180 
ggcccgaaag ctgaagaagc taaagctgct ggtgcagaag ttgtcggcgc agaagacctg 240 
atggaagcca ttcagggcgg cagcattgat ttcgatcgtg atgcccttta tactgaaatt 300 
gccttgcgct gccataatga agcagcctcc ggtgttttgg cagatttaag cgctgcctga 360 
ttttcgtgat cctctagagt ctatgaaatg gagattcatt tatgcctctc tcttattcgg 420 
ataaccatcc agtcatccgc aagcttggcc 450 



<210> 2 
<211> 1509 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : primer 
<400> 2 

gatcaaccgg caatttatcc acggcatcaa attcgatctg tcttttcccg tatcattggc 60 
aataccggca ttctgattac aggccgtgtt ttgaatgcgg tatgcagttt tgtctatgtc 120 



<220> 
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ctgtttggtg 


tcatgctttt 


gattacgact 


180 


tttcagtcat 


ggcaaacctt 


gctgcattac 


240 


aaccaatttq 


atgatgtcct 


tgccttttgc 


300 


ggtatgttgg 


tagggttagg 


cggtatcttg 


360 


gccgaggtca 


agccagatgc 


cttgctttgt 


420 


tggtccaacc 


gggatgttqc 


qqctqtqtaa 


480 


tatt acgacc 


tgcgtcagaa 


ccggaggttg 


540 


ggggtatttt 


ttgtttatat 


ggtgcctgac 


600 


cgctggcatt 


tatctctttc 


accaatatac 


660 


tcagagaaaa 


acgcccqttq 


aagggat gtg 


720 


aat cct agac 


t ccat ttt cc 


agcagggt gg 


780 


t QQQQaaQCC 


gctgt ct at c 


aqa tcqccrco 


840 


acagat gatg 


at cggctaac 


at gcat ccac 


900 


tattgat at g 


ccgaaaagga 


t acaacat ct 


960 


cccattctgc 


aaaattgtta 


caacttagtc 


1020 


ctttttcqqc 


atgagcaacc 


aacat ttt ca 


1080 


cggttagcca 


taaccatttt 


acctgt ccgg 


1140 


ggtgttgtta 


ccttqccqaa 


gggcaccggt 


1200 


ggcccgaaaq 


ctgaagaagc 


taaagctgct 


1260 


atggaagcca 


ttcagggcgg 


cagcattgat 


1320 


gccttgcgct 


gccataatga 


agcagcctcc 


1380 


ttttcgtgat 


cctctagagt 


ctatgaaatg 


1440 


ataaccatcc 


agtcatccgc 


aagcttggcc 


1500 
1509 



gtaatccat 



<210> 3 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 3 

cgaattcctg ccgaagttta ttagcca * 27 
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<210> 4 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 4 

aaggatcctt ccaccagcta tttgttagtg a 

<210> 5 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 5 

agaattctgc cagttggttg acgatag 

<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 6 

caggatcccc tcaagtcact agttaaactg 

<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 7 

taatacgact cactataggg 

<210> 8 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 8 

taacaatttc acacagga 
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<210> 9 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 9 

cacgacgttg taaaacgac 



<210> 10 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 10 

gactggatgg ttatccgaat aagagagagg 30 



<210> 11 
<211> 8 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 11 
ggcgcgcc 

<210> 12 
<211> 11544 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : vector 
<220> 

<221> misc_feature 
<222> 3282-4281 
<223> n=a,c,g or t 

<220> 

<221> misc_feature 
<222> 9476-11544 
<223> n=a,c,g or t 

<220> 

<223> nucleotide positions 1-1451 encodes promoter 
<220> 

<223> nucleotide positions 1452-2735 encodes celZ gene 



<220> 
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<223> nucleotide positions 4916-5776 encodes bla gene 
<220> 

<223> nucleotide positions 7061-8251 encodes tet gene 
<220> 

<223> nucleotide positions 9476-11544 encodes target sequence from K. oxytoca 

<220> 
<221> CDS 

<222> (1452) . . (2735) 

<220> 
<221> CDS 

<222> (4916) . . (5776) 

<220> 
<221> CDS 

<222> (7061) . . (8251) 



<400> 12 
gatcaaccgg 


caatttatcc 


acggcatcaa 


attcgatctg 


tcttttcccg 


tatcattggc 


60 


aataccggca 


ttctgattac 


aggccgtgtt 


ttgaatgcgg 


tatgcagttt 


tgtctatgtc 


120 


gcatggacat 


cccagacatt 


gggattgaac 


ctgtttggtg 


tcatgctttt 


gattacgact 


180 


tttgctaccc 


tgatttcgga 


tattacccgt 


tttcagtcat 


ggcaaacctt 


gctgcattac 


240 


ggttcaaaag 


cttttcagga 


aaaagatttt 


aaccaatttg 


atgatgtcct 


tgccttttgc 


300 


atcagagccg 


atttttttag 


tgcggcgata 


ggtatgttgg 


tagggttagg 


cggtatcttg 


360 


attttaggca 


cttcaagatt 


gggatggcct 


gccgaggtca 


agccagatgc 


cttgctttgt 


420 


atgctgatta 


tactttt tat 


gaatatcggc 


tggtccaacc 


gggatgttgc 


ggctgtgtaa 


480 


ccgctttaaa 


ctggtcacta 


tttatgagtt 


tattacgacc 


tgcgtcagaa 


ccggaggttg 


540 


tggcattggt 


tattggcttc 


atatgccttt 


ggggtatttt 


ttgtttatat 


ggtgcctgac 


600 


gcaattcacg 


ctttttgtca 


cctgtagtta 


cgctggcatt 


tatctctttc 


accaatatac 


660 


ggagcgagca 


ttt ccgataa 


gaaaaatatt 


tcagagaaaa 


acgcccgttg 


aagggatgtg 


720 


gaaatt cact 


ttaagcgtca 


gttttaatga 


aatcct agac 


tccattttcc 


agcagggtgg 


780 


cacccttgct 


attggtagct 


cactgggggc 


tggggaagcc 


gctgtctatc 


gggtcgcgcg 


840 


ccagattagt 


aacggtttat 


ccaaaccagc 


acagatgatg 


atcggctaac 


atgcatccac 


900 


cggcagcacc 


ggccgtttta 


tgcttgggat 


tattgatatg 


ccgaaaagga 


tacaacatct 


960 


ggaagaaaaa 


gacgaaggcc 


ggaataagcg 


cccattctgc 


aaaattgtta 


caacttagt c 


1020 


gcgccatcag 


ggaatgaaaa 


atcaatccgt 


ct tttt cggc 


atgagcaacc 


aacattttca 


1080 


aggtat catc 


ctgatgcgca 


atat cggcat 


cggttagcca 


taaccatttt 


acctgtccgg 


1140 


cggccttaat 


accttgatca 


gatggttcgt 


ggtgttgtta 


ccttgccgaa 


gggcaccggt 


1200 
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aaaaatgttc gcgtcggtgt tttcgcccgt ggcccgaaag ctgaagaagc taaagctgct 1260 

ggtgcagaag ttgtcggcgc agaagacctg atggaagcca ttcagggcgg cagcattgat 1320 

ttcgatcgtg atgcccttta tactgaaatt gccttgcgct gccataatga agcagcctcc 1380 

ggtgttttgg cagatttaag cgctgcctga ttttcgtgat cctctagagt ctatgaaatg 1440 

gagattcatt t atg cct etc tct tat teg gat aae cat cca gtc ate gat 1490 

Met Pro Leu Ser Tyr Ser Asp Asn His Pro Val lie Asp 
15 10 



age eaa aaa cac gee cca cgt aaa aaa ctg ttt eta tet tgt gcc tgt 
Ser Gin Lys His Ala Pro Arg Lys Lys Leu Phe Leu Ser Cys Ala Cys 
15 20 25 



1538 



tta gga tta age ett gcc tgc ett tee agt aat gcc tgg geg agt gtt 
Leu Gly Leu Ser Leu Ala Cys Leu Ser Ser Asn Ala Trp Ala Ser Val 
30 35 40 45 



1586 



gag ccg tta tee gtt age ggc aat aaa ate tae gca ggt gaa aaa gee 
Glu Pro Leu Ser Val Ser Gly Asn Lys lie Tyr Ala Gly Glu Lys Ala 

50 55 60 



1634 



aaa agt ttt gee ggc aae age tta ttc tgg agt aat aat ggt tgg ggt 
Lys Ser Phe Ala Gly Asn Ser Leu Phe Trp Ser Asn Asn Gly Trp Gly 

65 70 75 



1682 



ggg gaa aaa ttc tae aca gee gat aee gtt geg teg ctg aaa aaa gac 
Gly Glu Lys Phe Tyr Thr Ala Asp Thr Val Ala Ser Leu Lys Lys Asp 
80 85 90 



1730 



tgg aaa tec age att gtt cgc gcc get atg gge gtt cag gaa age ggt 
Trp Lys Ser Ser lie Val Arg Ala Ala Met Gly Val Gin Glu Ser Gly 
95 100 105 



1778 



ggt tat ctg cag gac ccg get gge aae aag gee aaa gtt gaa aga gtg 
Gly Tyr Leu Gin Asp Pro Ala Gly Asn Lys Ala Lys Val Glu Arg Val 
110 115 120 125 



1826 



gtg gat gee gca ate gee aae gat atg tat gtg att att gac tgg cac 
Val Asp Ala Ala lie Ala Asn Asp Met Tyr Val lie lie Asp Trp His 

130 135 140 



1874 



tea cat tct gca gaa aae aat cgc agt gaa gee att cgc ttc ttc cag 
Ser His Ser Ala Glu Asn Asn Arg Ser Glu Ala lie Arg Phe Phe Gin 

145 150 155 



1922 



gaa atg geg cgc aaa tat gge aae aag ccg aat gtc att tat gaa ate 
Glu Met Ala Arg Lys Tyr Gly Asn Lys Pro Asn Val lie Tyr Glu lie 
160 165 170 



1970 



tae aae gag ccg ett cag gtt tea tgg age aat ace att aaa cct tat 
Tyr Asn Glu Pro Leu Gin Val Ser Trp Ser Asn Thr lie Lys Pro Tyr 
175 180 185 



2018 



gcc gaa gcc gtg att tee gcc att cgc gee att gac ccg gat aac ctg 
Ala Glu Ala Val lie Ser Ala lie Arg Ala lie Asp Pro Asp Asn Leu 
190 195 200 205 



2066 



att att gtc ggt acg ccc agt tgg teg caa aae gtt gat gaa geg teg 



2114 
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lie lie Val Gly Thr Pro Ser Trp Ser Gin Asn Val Asp Glu Ala Ser 

210 215 220 

cgc gat cca ate aac gcc aag aat ate gee tat acg etg eat ttc tae 2162 
. Arg Asp Pro lie Asn Ala Lys Asn lie Ala Tyr Thr Leu His Phe Tyr 

225 230 235 

geg gga ace cat ggt gag tea tta egc aet aaa gcc egc eag geg tta 2210 
Ala Gly Thr His Gly Glu Ser Leu Arg Thr Lys Ala Arg Gin Ala Leu 
240 245 250 

aat aac ggt att geg ett ttc gte acc gag tgg ggc gee gtt aac geg 2258 
Asn Asn Gly lie Ala Leu Phe Val Thr Glu Trp Gly Ala Val Asn Ala 
255 260 265 

gae ggc aat ggc gga gtg aac eag aca gat acc gae gcc tgg gta acg 2306 
Asp Gly Asn Gly Gly Val Asn Gin Thr Asp Thr Asp Ala Trp Val Thr 
270 275 280 285 

ttc atg cgt gae aac aac ate age aac gea aac tgg geg tta aat gat 2354 
Phe Met Arg Asp Asn Asn lie Ser Asn Ala Asn Trp Ala Leu Asn Asp 

290 295 300 

aaa age gaa ggg gca tea ace tat tat eeg gae tet aaa aac etg acc 2402 
Lys Ser Glu Gly Ala Ser Thr Tyr Tyr Pro Asp Ser Lys Asn Leu Thr 

305 310 315 

gag teg ggt aaa ata gta aaa teg ate att caa age tgg cca tat aaa 2450 
Glu Ser Gly Lys lie Val Lys Ser lie lie Gin Ser Trp Pro Tyr Lys 
320 325 330 

geg ggc age gee gee agt aca aca ace gat cag tea aec gat ace acc 24 98 
Ala Gly Ser Ala Ala Ser Thr Thr Thr Asp Gin Ser Thr Asp Thr Thr 
335 340 345 

atg gea cca eeg ttg acg aac cga cca caa eeg aca cac egg caa aec 254 6 
Met Ala Pro Pro Leu Thr Asn Arg Pro Gin Pro Thr His Arg Gin Thr 
350 355 360 365 

get gat tgc tge aat gee aac gtt tae cee aac tgg gtt age aaa gae 2594 
Ala Asp Cys Cys Asn Ala Asn Val Tyr Pro Asn Trp Val Ser Lys Asp 

370 375 380 

tgg geg ggc egg cag cga etc ata acg aag eag gee aat ega teg tet 2642 
Trp Ala Gly Arg Gin Arg Leu lie Thr Lys Gin Ala Asn Arg Ser Ser 

385 390 395 

aca aag gga ace tgt ata eeg caa aet ggt aca ett cat eeg ttc egg 2690 
Thr Lys Gly Thr Cys lie Pro Gin Thr Gly Thr Leu His Pro Phe Arg 
400 405 410 

gea geg att cet cet ggg cac agg ttg gta get gta aet aat tga 2735 
Ala Ala lie Pro Pro Gly His Arg Leu Val Ala Val Thr Asn 
415 420 425 

ttaatctttt caeccccaaa ataaeaggge tgegattgea gcctgatacg caacattcca 2795 

ttacttaatt gegtteaaaa gegceeaaat ccggtgeget gecttgtaac taatatgatt 2855 

tetetttcgt aeecgcgtta ateagetttg agttageega cagacggaac agegaggttg 2915 
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ccggcaacgt gccgtcatta tcacgagata cggtagccag cgaggtgtcc aggctgacga 2975 
atcggacgcg gaagccgctg tccgtatcca tgagttgact cgcatccgca ttactgaccg 3035 
ttgcagaagc agacagagac acgttgttgc ggaagtaatg tttctgtcct gactggacgt 3095 
tgctcccgaa agcataatta atgccgtttt tatatgacgt gttatttatt accgtacgcc 3155 
gccgcgttat tgttctggtc aaaacctttg ctcacgttgc caaacgcgac gcaacgggta 3215 
atgcgatgat tgccgaccgc tggttcctcc cagtttgaac ccgttggcat tgccggcgaa 3275 
cgcgctnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3335 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 33 95 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 34 55 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3515 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3575 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3635 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3695 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3755 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3815 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 38 75 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 935 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3995 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 055 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4115 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4175 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4235 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngatc ctctagagtc 42 95 
gacctgcagg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt 4355 
tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga 4415 
ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat 4475 
gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcataggcg cgcctatggt 4535 
gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa 4595 
cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg 4 655 
tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga 4715 
gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt 4775 
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cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt 4835 

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat 4895 

aatattgaaa aaggaagagt atg agt att caa cat ttc cgt gtc gcc ctt att 4 948 

Met Ser lie Gin His Phe Arg Val Ala Leu lie 
15 10 

ccc ttt ttt gcg gca ttt tgc ctt cct gtt ttt get cac cca gaa acg 4996 
Pro Phe Phe Ala Ala Phe Cys Leu Pro Val Phe Ala His Pro Glu Thr 

15 20 25 

ctg gtg aaa gta aaa gat get gaa gat cag ttg ggt gca cga gtg ggt 5044 
Leu Val Lys Val Lys Asp Ala Glu Asp Gin Leu Gly Ala Arg Val Gly 
30 35 40 

tac ate gaa ctg gat etc aac age ggt aag ate ett gag agt ttt cgc 5092 
Tyr lie Glu Leu Asp Leu Asn Ser Gly Lys lie Leu Glu Ser Phe Arg 
45 50 55 



ccc gaa gaa 
Pro Glu Glu 
60 

ggc gcg gta 
Gly Ala Val 



cgc ata cac 
Arg He His 



gaa aag cat 
Glu Lys His 
110 

gee ata ace 
Ala He Thr 
125 

ate gga gga 
He Gly Gly 
140 

eat gta act 
His Val Thr 



cgt ttt cca 
Arg Phe Pro 
65 

tta tee cgt 
Leu Ser Arg 
80 

tat tct cag 
Tyr Ser Gin 
95 

ett acg gat 
Leu Thr Asp 



atg agt gat 
Met Ser Asp 



ccg aag gag 
Pro Lys Glu 
145 

cgc ctt gat 
Arg Leu Asp 
160 



atg atg age 
Met Met Ser 



att gae gee 
He Asp Ala 



aat gae ttg 
Asn Asp Leu 
100 

ggc atg aca 
Gly Met Thr 
115 

aac act gcg 
Asn Thr Ala 
130 

eta ace get 
Leu Thr Ala 



cgt tgg gaa 
Arg Trp Glu 



act ttt aaa 
Thr Phe Lys 
70 

ggg caa gag 
Gly Gin Glu 
85 

gtt gag tac 
Val Glu Tyr 



gta aga gaa 
Val Arg Glu 



gee aac tta 
Ala Asn Leu 
135 

ttt ttg cac 
Phe Leu His 
150 

ccg gag ctg 
Pro Glu Leu 
165 



gtt ctg eta 
Val Leu Leu 



caa etc ggt 
Gin Leu Gly 
90 

tea cca gtc 
Ser Pro Val 
105 

tta tgc agt 
Leu Cys Ser 
120 

ett ctg aca 
Leu Leu Thr 



aac atg ggg 
Asn Met Gly 



aat gaa gee 
Asn Glu Ala 
170 



tgt 5140 
Cys 
75 

cgc 5188 
Arg 



aca 5236 
Thr 



get 5284 
Ala 



acg 5332 
Thr 



gat 5380 

Asp 

155 

ata 5428 
He 



cca aac gae gag cgt gae ace acg atg ect gta gca atg gca aca acg 
Pro Asn Asp Glu Arg Asp Thr Thr Met Pro Val Ala Met Ala Thr Thr 

175 180 185 



5476 



ttg cgc aaa eta tta act ggc gaa eta ett act eta get tec egg caa 
Leu Arg Lys Leu Leu Thr Gly Glu Leu Leu Thr Leu Ala Ser Arg Gin 
190 200 



5524 



caa tta ata gae tgg atg gag gcg gat aaa gtt gca gga cca ctt ctg 
Gin Leu He Asp Trp Met Glu Ala Asp Lys Val Ala Gly Pro Leu Leu 
205 210 215 



5572 
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cgc teg gcc ctt ccg get ggc tgg ttt att get gat aaa tet gga gee 5620 

Arg Ser Ala Leu Pro Ala Gly Trp Phe lie Ala Asp Lys Ser Gly Ala 

220 225 230 235 

ggt gag cgt ggg tet cgc ggt ate att gea gea ctg ggg eea gat ggt 5668 

Gly Glu Arg Gly Ser Arg Gly lie lie Ala Ala Leu Gly Pro Asp Gly 

240 245 250 

aag ccc tec cgt ate gta gtt ate tae aeg aeg ggg agt cag gea act 5716 

Lys Pro Ser Arg lie Val Val lie Tyr Thr Thr Gly Ser Gin Ala Thr 

255 260 265 

atg gat gaa ega aat aga eag ate get gag ata ggt gee tea ctg att 5764 

Met Asp Glu Arg Asn Arg Gin lie Ala Glu lie Gly Ala Ser Leu lie 
270 275 280 

aag cat tgg taa etgteagaee aagtttaete atatataett tagattgatt 5816 
Lys His Trp 
285 



taaaaettea 


tttttaattt 


aaaaggatet 


aggtgaagat 


eetttttgat 


aatctcatga 


5876 


ecaaaatece 


ttaaegt gag 


ttttegtt ee 


aetgagcgtc 


agaeeeegta 


gaaaagatea 


5936 


aaggatctte 


ttgagatcet 


ttttttetge 


gegtaatctg 


etgettgeaa 


acaaaaaaae 


5996 


caccgetaee 


ageggtggtt 


tgtttgeegg 


at caagaget 


aeeaaetctt 


tttccgaagg 


6056 


taactggett 


eageagageg 


eagataeeaa 


ataetgtcet 


tctagt gtag 


eegtagttag 


6116 


gecaccaett 


eaagaactet 


gtageaeege 


etaeataeet 


egetctgeta 


ateetgttae 


6176 


eagtggctge 


tgecagtgge 


gataagtegt 


gt ettacegg 


gttggaet ca 


agaegatagt 


6236 


taceggataa 


ggegcagcgg 


tegggctgaa 


eggggggtte 


gtgcaeacag 


eeeagettgg 


6296 


agegaaegae 


etacaccgaa 


etgagataee 


taeagegtga 


gctatgagaa 


agegeeacgc 


6356 


ttceegaagg 


gagaaaggeg 


gacaggtate 


eggtaagcgg 


eagggt egga 


aeaggagage 


6416 


geacgaggga 


get tecaggg 


ggaaaegeet 


ggtatcttta 


tagtcctgte 


gggtttcgce 


6476 


acetetgact 


tgagegtega 


tttttgtgat 


getegteagg 


ggggeggage 


etatggaaaa 


6536 


aegecageaa 


egeggecttt 


ttaeggtt ee 


tggeettttg 


etggeetttt 


gctcacatgt 


6596 


tcttteetge 


gttat ecect 


gattetgtgg 


ataaccgtat 


taeegeettt 


gagtgagetg 


6656 


ataccgeteg 


ccgeagcega 


aegacegagc 


geagegagte 


agtgagcgag 


gaageggcgc 


6716 


gceageggaa 


gagegeeeaa 


taegcaaaee 


geetetceee 


gcgegt tgge 


egatteatta 


6776 


atgeagctgg 


eacgaeaggt 


ttcccgactg 


gaaagcggge 


agtgagegea 


aegcaattaa 


6836 


tgtgagttag 


eteaeteatt 


aggcaeecea 


ggetttaeac 


tttatgette 


eggctegtat 


6896 


gttgtgtgga 


att gtgagcg 


gataacaatt 


teaeacagga 


aaeagctatg 


accatgatta 


6956 


egceaagett 


geatgecaat 


teteatgttt 


gaeagettat 


eategataag 


etttaatgeg 


7016 


gtagtttate 


acagttaaat 


tgetaaegea 


gtcaggeaee 


gtgt atg aaa tet aae 


7072 
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Met Lys Ser Asn 

aat gcg etc ate gtc ate etc ggc ace gtc acc ctg gat get gta ggc 7120 
Asn Ala Leu lie Val lie Leu Gly Thr Val Thr Leu Asp Ala Val Gly 
5 10 15 20 

ata gge ttg gtt atg ccg gta etg ccg ggc etc ttg egg gat ate gtc 7168 
lie Gly Leu Val Met Pro Val Leu Pro Gly Leu Leu Arg Asp lie Val 

25 30 35 

eat tee gac age ate gee agt cac tat ggc gtg ctg eta gcg eta tat 7216 
His Ser Asp Ser lie Ala Ser His Tyr Gly Val Leu Leu Ala Leu Tyr 

40 45 50 

gcg ttg atg caa ttt eta tgc gea ccc gtt etc gga gea etg tec gac 7264 
Ala Leu Met Gin Phe Leu Cys Ala Pro Val Leu Gly Ala Leu Ser Asp 
55 60 65 

cgc ttt ggc cgc cgc cca gtc ctg etc get teg eta ctt gga gee act 7312 
Arg Phe Gly Arg Arg Pro Val Leu Leu Ala Ser Leu Leu Gly Ala Thr 
70 75 80 

ate gac tac gcg ate atg gcg acc aca ccc gtc etg tgg ate etc tac 7360 
lie Asp Tyr Ala lie Met Ala Thr Thr Pro Val Leu Trp lie Leu Tyr 
85 90 95 100 

gee gga cgc ate gtg gcc ggc ate acc ggc gee aca ggt gcg gtt get 7408 
Ala Gly Arg lie Val Ala Gly lie Thr Gly Ala Thr Gly Ala Val Ala 

105 110 115 

gge gcc tat ate gee gac ate ace gat ggg gaa gat egg get cgc cac 7456 
Gly Ala Tyr lie Ala Asp lie Thr Asp Gly Glu Asp Arg Ala Arg His 

120 125 130 

tte ggg etc atg age get tgt ttc ggc gtg ggt atg gtg gea ggc ccc 7504 
Phe Gly Leu Met Ser Ala Cys Phe Gly Val Gly Met Val Ala Gly Pro 
135 140 145 

gtg gee ggg gga ctg ttg gge gcc ate tee ttg eat gea cca ttc ctt 7552 
Val Ala Gly Gly Leu Leu Gly Ala lie Ser Leu His Ala Pro Phe Leu 
150 155 160 

gcg gcg gcg gtg etc aac ggc etc aac eta eta ctg ggc tgc ttc eta 7600 
Ala Ala Ala Val Leu Asn Gly Leu Asn Leu Leu Leu Gly Cys Phe Leu 
165 170 175 180 

atg eag gag teg eat aag gga gag cgt ega ccg atg ccc ttg aga gee 7648 
Met Gin Glu Ser His Lys Gly Glu Arg Arg Pro Met Pro Leu Arg Ala 

185 190 195 

ttc aac cca gtc age tec ttc egg tgg gcg egg ggc atg act ate gtc 7696 
Phe Asn Pro Val Ser Ser Phe Arg Trp Ala Arg Gly Met Thr lie Val 

200 205 210 

gee gea ctt atg act gtc ttc ttt ate atg caa etc gta gga eag gtg 7744 
Ala Ala Leu Met Thr Val Phe Phe lie Met Gin Leu Val Gly Gin Val 
215 220 225 



ccg gea gcg etc tgg gtc att ttc gge gag gac cgc ttt cgc tgg age 
Pro Ala Ala Leu Trp Val lie Phe Gly Glu Asp Arg Phe Arg Trp Ser 
230 235 240 



7792 
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gcg acg atg ate ggc ctg teg ctt gcg gta ttc gga ate ttg cac gee 7840 
Ma Thr Met lie Gly Leu Ser Leu Ala Val Phe Gly lie Leu His Ala 
245 250 255 260 

etc get caa gee tte gte act ggt cce gee aec aaa cgt ttc ggc gag 7888 
Leu Ala Gin Ala Phe Val Thr Gly Pro Ala Thr Lys Arg Phe Gly Glu 

265 270 275 

aag cag gee att ate gee ggc atg gcg gee gac gcg ctg ggc tac gte 7936 
Lys Gin Ala lie lie Ala Gly Met Ala Ala Asp Ala Leu Gly Tyr Val 

280 285 290 

ttg ctg gcg ttc gcg acg cga ggc tgg atg gcc ttc ccc att atg att 7 984 
Leu Leu Ala Phe Ala Thr Arg Gly Trp Met Ala Phe Pro lie Met lie 
295 300 305 

ctt etc get tec ggc ggc ate ggg atg ccc gcg ttg cag gee atg ctg 8032 
Leu Leu Ala Ser Gly Gly lie Gly Met Pro Ala Leu Gin Ala Met Leu 
310 315 320 

tec agg cag gta gat gac gac cat cag gga cag ctt caa gga teg etc 8080 
Ser Arg Gin Val Asp Asp Asp His Gin Gly Gin Leu Gin Gly Ser Leu 
325 330 335 340 

gcg get ctt ace age eta act teg ate act gga ccg ctg ate gte acg 8128 
Ala Ala Leu Thr Ser Leu Thr Ser lie Thr Gly Pro Leu lie Val Thr 

345 350 355 

gcg att tat gcc gee teg gcg age aca tgg aac ggg ttg gca tgg att 8176 
Ala He Tyr Ala Ala Ser Ala Ser Thr Trp Asn Gly Leu Ala Trp He 

360 365 370 

gta ggc gcc gee eta tac ctt gte tgc etc cce gcg ttg cgt cgc ggt 8224 
Val Gly Ala Ala Leu Tyr Leu Val Cys Leu Pro Ala Leu Arg Arg Gly 
375 380 385 

gca tgg age egg gcc aec teg aec tga atggaagccg geggcaccte 8271 
Ala Trp Ser Arg Ala Thr Ser Thr 



390 




395 










gctaaeggat 


teaccactcc 


aagaattgga 


gecaatcaat 


tcttgcggag 


aactgtgaat 


8331 


gcgcaaacea 


acecttggea 


gaacatatcc 


ategcgtecg 


ccatcteeag 


cagcegcacg 


8391 


cggegcatet 


eggggtcgac 


tctagaggat 


ecccgeaacg 


ctgtcagcgc 


tttecagtta 


8451 


aacggcteca 


acgtcgecat 


aggtaattce 


t egcccggcc 


atacgategg 


gcaggtgccg 


8511 


ttggctateg 


ccgtcgcctg 


actcatcaca 


ctatcttccg 


ctgcatcgcg 


aagggttttg 


8571 


aceacttett 


ecatctctec 


gtgcgcegga 


tgceatgetc 


aegtacgegg 


ettatcagat 


8631 


agtcgggcag 


gccgtcgttc 


cagcceaatg 


aggggaagct 


ggegtggagc 


gatgecagca 


8691 


eetgctcctc 


aacacegtaa 


tggeeggcgg 


cgaacaggea 


ttcggeggta 


agegettcca 


8751 


gcectttaat 


catcaegetg 


eggcacatet 


tgatagccga 


cacgetgeea 


acgtggttae 


8811 


caccatagcg 


ggcgttacat 


ccaagcgtgg 


tgagtaattc 


agcaattgcc 


tctgcctgtg 


8871 



Atty . Docket No. BCI-024 1 3 

gtccccccgt caacagcggc gttcggagtg cccctggggg gaccggcgcc atcaccgcta 8931 
catcgacata agcgccgggc ttaaagcatt tggcagcctg acgcttggtc tgcggggcga 8991 
cggagttaag gtcaagaaaa tactgcgtgt cggtcatcag cggtgcagct tgtgaggcga 9051 
catccagggc ggatcccgcg gtgacggtgg aaaatatgag ttcggcacct gtcaacgcgt 9111 
cagccaggga gattgccgcc cgcacgcctc cccgatgcgc cttcgttatc atcgcatcgc 9171 
gctcaggacc ttgcagcttg caatcccaga cggtgactgg gttcactttt gccagtgcat 9231 
ccgcaagaat gccacctgct tcaccataac ctataaacgt tattgtcgtc ataacagctc 9291 
cttacgcggc cacacgtcgg ccggaatgca aacgtcgccc gcgaacagaa gtcgcgccgt 9351 
acgcagcaga ccgcagcctg ccaactgccc attatcatca agccggagcg ccacgctgaa 9411 
ttgggtaccg agctccgaat tgggtaccga gctcgaatta attcgagctc ggtacccggg 9471 
gatcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9531 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9591 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9651 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9711 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9771 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9831 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 98 91 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9951 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10011 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10071 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10131 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10191 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10251 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10311 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10371 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10431 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10 4 91 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10551 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10611 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10 671 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10731 
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nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 



nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 107 91 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10851 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10911 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10971 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11031 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11091 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11151 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11211 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11271 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11331 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11391 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11451 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11511 
nnn 11544 



<210> 13 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 13 

ctgttccgtt accaacac 

<210> 14 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 14 

gtgaatggga tcacgagt 



<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Primer 
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<400> 15 

accatcagca tcaacgccca acaacg 

<210> 16 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 16 

gactggatgg ttatccgaat aagagagagg 

<210> 17 
<211> 11772 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> inisc_f eature 
<222> 134-1143 
<223> n=a,c,g or t 

<220> 

<221> mi sc__f eature 
<222> 5753-7567 
<223> n=a,c,g or t 

<220> 

<223> Description of Artificial Sequence: Synthetic 
construct 

<220> 

<223> unsequenced Erwinia DNA fragment from nucleotide 
position 144 to 1143 

<220> 

<223> PI promoter region for celZ from nucleotide 
position 2974 to 4424 

<220> 

<223> guide fragment for integration from nucleotide 
position 4677 to 7573 

<220> 

<223> sequenced partial guide fragment from nucleotide 
position 4677 to 5752 

<220> 

<223> unsequenced partial guide fragment from nucleotide 
position 5753 to 7573 

<220> 

<223> P2 promoter region for celY from nucleotide 
position 7585 to 8576 

<220> 

<223> R6K-Y ori from nucleotide position 10388 to 10763 
<220> 
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<223> FRTF lipase-binding sequence from nucleotide 
position 16 to 50 

<220> 

<223> FRTFlipase-binding sequence from nucleotide 
position 10058 to 10092 

<220> 

<223> celZ gene product is encoded by the complement of 
nucleotides 1690 to 2973 

<220> 

<223> celY gene product is encoded by the nucleotides 
8576 to 9574 



<220> 
<223> 



kanamicin- resistance gene product is encoded by 
the complement of nucleotides 10827 to 11621 



<400> 17 

atcgatgaat 

caagcttgat 

tgcaggtcga 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnagcgcgt 

tcatcgcatt 

taacgcggcg 

tcgggagcaa 

cttctgcaac 

gcgtccgatt 

cgttgccggc 

cgaaagagaa 

attaagtaat 

aaagattaat 

cggatgaagt 

cttcgttatg 

ggcattgcag 

tgccatggtg 

tggccagctt 

cggataatag 

gttgttgtca 

gccgtccgcg 

ctggcgggct 

ggcgatattc 

gggcgtaccg 

tt cggcataa 



tgatccgaag 

ctggcttatc 

ctctagagga 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

tcgccggcaa 

acccgttgcg 

gcgtacggta 

cgtccagtca 

ggtcagtaat 

cgtcagcctg 

aacctcgctg 

atcatattag 

ggaatgttgc 

caattagtta 

gtaccagttt 

agtcgctgcc 

caatcagcgg 

gtatcggttg 

tgaatgatcg 

gttgatgccc 

cgcatgaacg 

ttaacggcgc 

ttagtgcgta 

ttggcgttga 

acaataat ca 

ggtttaatgg 



ttcctattct 

gaaattaata 

tcannnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

tgccaacggg 

tcgcgtttgg 

ataaataaca 

ggacagaaac 

gcggatgcga 

gacacctcgc 

ttccgtctgt 

ttacaaggca 

gtatcaggct 

cagctaccaa 

gcggtataca 

ggcccgccca 

tttgccggtg 

actgatcggt 

attttactat 

cttcgctttt 

ttacccaggc 

cccact cggt 

atgactcacc 

ttggatcgcg 

ggttatccgg 

tattgctcca 



ctagaaagta 

cgactcacta 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

ttcaaactgg 

caacgtgagc 

cgtcatataa 

attacttccg 

gtcaactcat 

tggctaccgt 

cggctaactc 

gcgcaccgga 

gcaatcgcag 

cctgtgccca 

ggttcccttt 

gt ctttgcta 

tgtcggttgt 

tgttgtactg 

tttacccgac 

atcatttaac 

gtcggtatct 

gacgaaaagc 

atgggtt ccc 

cgacgctt ca 

gtcaatggcg 

tgaaacctga 



taggaacttc 

tagggagacc 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

gaggaaccag 

aaaggttttg 

aaacggcatt 

caacaacgtg 

ggatacggac 

at ctcgtgat 

aaagctgatt 

tttgggcgct 

ccctgttatt 

ggaggaatcg 

gtagacgatc 

acccagttgg 

ggtcggttcg 

gcggcgctgc 

tcggtcaggt 

gcccagtttg 

gt ctggttca 

gcaataccgt 

gcgtagaaat 

tcaacgtttt 

cgaatggcgg 

agcggctcgt 



gaattgtcga 60 
ggaattcccc 120 
nnnnnnnnnn 180 
nnnnnnnnnn 240 
nnnnnnnnnn 300 
nnnnnnnnnn 3 60 
nnnnnnnnnn 4 20 
nnnnnnnnnn 4 80 
nnnnnnnnnn 54 0 
nnnnnnnnnn 600 
nnnnnnnnnn 6 60 
nnnnnnnnnn 7 20 
nnnnnnnnnn 7 80 
nnnnnnnnnn 84 0 
nnnnnnnnnn 900 
nnnnnnnnnn 9 60 
nnnnnnnnnn 102 0 
nnnnnnnnnn 108 0 
nnnnnnnnnn 114 0 
cggtcggcaa 1200 
accagaacaa 1260 
aattatgctt 1320 
tctctgtctg 1380 
agcggcttcc 1440 
aatgacggca 1500 
aacgcgggta 1560 
tttgaacgca 1620 
ttgggggtga 1680 
ctgcccggaa 1740 
gattggcctg 1800 
ggtaaacgtt 18 60 
tcaacggtgg 1920 
ccgctttata 1980 
ttttagagtc 2040 
cgttgctgat 2100 
ctccgccatt 2160 
tatttaacgc 2220 
gcagcgtata 2280 
gcgaccaact 234 0 
aaatcacggc 2400 
tgtagatttc 2460 
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ataaatgaca 
ttcactgcga 
ggcgattgcg 
ataaccaccg 
tttcagcgac 
ccagaataag 
gctaacggat 
tcctaaacag 
gactggatgg 
gaggatcacg 
tggcagcgca 
tgaatggctt 
tcagctttcg 
aaggtaacaa 
gttatggcta 
tcatgccgaa 
ttttgcagaa 
tcggcatatc 
cgatcatcat 
cagcggcttc 
tggagtctag 
gcgtttttct 
gataaatgcc 
acaaaaaata 
cgcaggtcgt 
tcccggtt gg 
ggcttgacct 
cct accaaca 
tcatcaaatt 
tgccatgact 
atgacaccaa 
cat accgcat 
aagacagatc 
gctcgaattc 
gaaaataccg 
cggtgcgggc 
taagttgggt 
tccatcaacg 
cacccaccct 
tcggcaaaga 
ttcgcgatcg 
ttcgccaggt 
tgttatcgct 
ttgaagaatt 
tggcggcctg 
ttcgctacga 
cggattactg 
aagtgcgctt 
aagagattat 
acacgctgcg 
agggcgacta 
tctatccgga 
tcgtttcggc 
ccaacctggc 
cggagctgat 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



ttcggcttgt 
ttgttttctg 
gcatccacca 
ctttcctgaa 
gcaacggtat 
ct gttgccgg 
aacggctcaa 
gcacaagata 
ttatccgaat 
aaaatcaggc 
aggcaatttc 
ccatcaggtc 
ggccacgggc 
caccacgaac 
accgatgccg 
aaagacggat 
tgggcgctta 
aataatccca 
ctgtgctggt 
cccagccccc 
gatttcatta 
ctgaaatatt 
agcgtaacta 
ccccaaaggc 
aataaactca 
accagccgat 
cggcaggcca 
tacctatcgc 
ggttaaaatc 
gaaaacgggt 
acaggttcaa 
tcaaaacacg 
gaatttgatg 
cgagcttggc 
catcaggcgc 
ctcttcgcta 
aacgccaggg 
cttgctgtaa 
gagcgttgag 
cccggctatc 
tatggttgag 
ttact acctg 
cggcatttat 
aattgatgaa 
cttcctcgat 
ctacgggatg 
gctggaatac 
cggcggacgc 
tgccgaagcc 
cctgtggagc 
cttcgcggcg 
tgactcgacc 
gacggtgcag 
ggacaaaatc 
gcgcctgctg 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



tgccatattt 
cagaatgtga 
ctctttcaac 
cgcccatagc 
cggctgtgt a 
caaaacttt t 
cactcgccca 
gaaacagttt 
aagagagagg 
agcgcttaaa 
agtataaagg 
ttctgcgccg 
gaaaacaccg 
catctgatca 
atattgcgca 
tgatttttca 
ttccggcctt 
agcataaaac 
ttggataaac 
agtgagctac 
aaactgacgc 
tttcttatcg 
caggtgacaa 
atatgaagcc 
taaatagtga 
attcataaaa 
t cccaatctt 
cgcactaaaa 
tttttcctga 
aatatccgaa 
tcccaatgtc 
gcctgtaatc 
ccgtggataa 
gcgcctatgc 
cattcgccat 
ttacgccagc 
ttttcccagt 
ccaggagcca 
gcgttaaagc 
gctaacaagc 
cgctggctgc 
tcgatggaat 
gaggatgtga 
gaaaacgacc 
tcgcttgcgg 
tttaagcaga 
ggtaacccgt 
attcagcagg 
tatgaccaga 
gcccaggcca 
gtggaagata 
tattcaggac 
gacatcctca 
gcgattcatc 
attgacgagc 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



gcgcgccatt 
gtgccagtca 
tttggccttg 
ggcgcgaaca 
gaatttttcc 
ggctttttca 
ggcattactg 
tttacgtggg 
cataaatgaa 
tctgccaaaa 
gcatcacgat 
acaacttctg 
acgcgaacat 
aggtattaag 
tcaggatgat 
ttccctgatg 
cgtctttttc 
ggccggtgct 
cgttactaat 
caatagcaag 
ttaaagtgaa 
gaaatgctcg 
aaagcgtgaa 
aataaccaat 
ccagtt taaa 
agtataatca 
gaagtgccta 
aaatcggctc 
aaagcttttg 
atcagggtag 
tgggatgtcc 
agaatgccgg 
attgccggtt 
ggtgtgaaat 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
aagctatgaa 
actctattgc 
atgaatggct 
gctcaaaccg 
ttttgattgg 
acagcgcgct 
cgggcttagg 
cgctggggtt 
atatcgtcga 
gggagttcga 
aaggtaaat a 
ttatccctgg 
gcagcgagat 
aaaaccattc 
gcgagctgcg 
gccgccacta 
tcaacgacac 
ataagatcag 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



tcctggaaga 
ataatcacat 
ttgccagccg 
atgctggatt 
ccaccccaac 
cctgcgtaga 
gaaaggcagg 
gcgtgttttt 
tctccatttc 
caccggaggc 
cgaaatcaat 
caccagcagc 
ttttaccggt 
gccgccggac 
accttgaaaa 
gcgcgactaa 
ttccagatgt 
gccggtggat 
ctggcgcgcg 
ggtgccaccc 
tttccacatc 
ctccgtat at 
ttgcgtcagg 
gccacaacct 
gcggttacac 
gcatacaaag 
aaatcaagat 
tgatgcaaaa 
aaccgtaatg 
caaaagtcgt 
atgcgacata 
tattgccaat 
gatcgatccc 
accgcacaga 
caactgttgg 
gggatgtgct 
taaaacgacg 
tgtacctttt 
ttataagctg 
caacgccacg 
cgcgcacgtc 
ccgtacgttg 
ggaagagatg 
caacggcggt 
accgggccgc 
tgggcggcag 
gcgccataat 
ctcccgctgg 
cttcgacacc 
taacctcggt 
cgagaacgtg 
cctgcggcag 
ccagctgcat 
gaacccggtg 
ctgggatgag 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



agcgaatggc 2520 
acatatcgtt 2580 
ggtcctgcag 2640 
tccagtcttt 2700 
cattattact 2760 
ttttattgcc 2820 
caaggcttaa 28 80 
ggctatcgat 2940 
atagactcta 3000 
tgcttcatta 3060 
gctgccgccc 3120 
tttagcttct 3180 
gcccttcggc 324 0 
aggtaaaatg 3300 
tgttggttgc 3360 
gttgtaacaa 3420 
tgtatccttt 3480 
gcatgttagc 354 0 
acccgataga 3600 
tgctggaaaa 3660 
ccttcaacgg 3720 
tggtgaaaga 3780 
caccatataa 3840 
ccggttctga 3900 
agccgcaaca 3960 
caaggcatct 4020 
accgcctaac 4080 
ggcaaggaca 4140 
cagcaaggtt 4200 
aatcaaaagc 4260 
gacaaaactg 4320 
gatacgggaa 4380 
cgggtaccga 4440 
tgcgtaagga 4500 
gaagggcgat 4560 
gcaaggcgat 4 620 
gccagtgaat 4 680 
agctactcgt 4740 
atgtttatca 4800 
ctgttcgccg 4860 
tctcaggaag 4920 
tccaacgcgc 4980 
gggctgaacc 5040 
cttggtcgtc 5100 
ggctacggta 5160 
aaagaatccc 5220 
acgcgctaca 5280 
gtggagaccg 5340 
gacgccacca 5400 
aaattcaacc 5460 
tcgcgggtac 5520 
gagtacttcc 5580 
aaaacctacg 5640 
ctgtcgattc 5700 
ggnnnnnnnn 57 60 
nnnnnnnnnn 5820 
nnnnnnnnnn 58 80 
nnnnnnnnnn 594 0 
nnnnnnnnnn 6000 
nnnnnnnnnn 60 60 
nnnnnnnnnn 612 0 
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nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnngaa 

ggactgaaag 

agtgtcatt c 

caattttgat 

ttcttttctg 

tattctgacg 

tgctgttttg 

ctatggtaaa 

cgctaccgct 

tgctttttat 

tccggctgct 

cgctttcaat 

tcagtattat 

tcgtgatgaa 

tatataaatg 

taatcgtttc 

agggcgaatt 

aaagccaatg 

ggcggcgaac 

ggataccggc 

ggtgcattac 

gcggaacacg 

ggtggataag 

gggaaataag 

catcgccagc 

tttcaacaag 

cgactttgct 

attggtcgga 

cgcggatggc 

tcgtatcccg 

gctgtactgg 

taacaccgcg 

caacctcgac 

gcgcctgctg 

ggacgatgt c 

ttgttaacct 

caagcttggc 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

ttcgagctcg 

gtttcgacgg 

atgacgatcc 

caataaagcg 

attacct gtc 

catctgacaa 

gaaccgttgg 

gacggcgaag 

ggaaatggta 

aaagcaggct 

aataatt etc 

tttaaataag 

agcggccatt 

tatttttgat 

gagcgagcta 

gatttgatcc 

aattcgccct 

tggcgttgtt 

ggctgggaaa 

aataagaatg 

gatgaccgca 

accagcggct 

aacaacgcct 

tggcaggaca 

aatatcattc 

aacagctatg 

aaccgcagcc 

gaaat gcgtt 

tcgatggcgc 

ctgtatttgt 

cgtaactatc 

acttacaata 

gggctcagcg 

gtgatgttgg 

cttgccattt 

gcttgttact 

gtaatcatgg 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

gtacccgggg 

cggttgtgtg 

agtt cgatat 

tttgacctga 

ctgttaacct 

ttggttccaa 

gtatcaaact 

aaggacgcat 

tcactatcgg 

tggctaatgg 

ggggtgcct a 

atttctttgg 

atggcttttg 

ttttcaagaa 

tggcgcgcgt 

tctagagtca 

tctgttccgt 

gggcgttgat 

tctataaaag 

tcagccacac 

tcgcgttcga 

tgttctactg 

cggatggcga 

accgttacct 

agtttgcggg 

tgatccttaa 

atcttcaggt 

tcggtcaggt 

cggcgacggc 

actggtatga 

cccgcctgac 

tgcagggcgg 

atctgccagg 

cgcgcggtaa 

tccccacttt 

cgtgatccca 

tcatagctgt 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

atcgacataa 

ggttttgctt 

tcaacagacc 

tgcatgaggg 

gtggatatag 

tgacctgaag 

tattcgcgag 

cat cat tgta 

tttggcagcg 

cggtaaggat 

tttatatgac 

tgcagggtta 

ccttgataaa 

aagcctgata 

aactgtcgaa 

acctgcttgt 

taccaacact 

gctgatggtg 

ccgttt catg 

cgaaggtcag 

taacctgtgg 

gcgttacgat 

tgtgctgatt 

gcaggcgtcg 

ccgcaccgtg 

cccgt cgtat 

gtggcggcaa 

cgggctgccg 

ctggccgtcg 

cgccaaaacc 

gacgccggcc 

tttgctggcg 

cgcatcggaa 

ataaccttat 

tatccctctg 

ttcacaaggg 

ttcctgtgtg 



nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

ccgataggtc 

gcccatctgg 

gtctttgtaa 

taaatccatt 

aaggtcggtt 

aaggcgcgca 

gtcgaaggac 

aagcctatta 

ccttctgatg 

gccggagaac 

cctgaaggca 

ttcaaaatag 

aaatttat ca 

tcttccaaca 

gactgtatcg 

tactcgtgat 

tgagccggag 

tggttcagtg 

accacggacg 

ggattcgcca 

aactggacgc 

ccgtcggcgg 

gcctgggcgt 

gacagcat cc 

atgttgcccg 

ttcctgttcc 

ctgattgacg 

acggactggg 

cgtttcagtt 

acggcgctgg 

tgggttgatg 

gtgcgcgacc 

gattactact 

tcttgcggta 

aatggcgtgt 

cgaattgacc 

aaattgttat 



nnnnnnnnnn 6180 
nnnnnnnnnn 62 4 0 
nnnnnnnnnn 6300 
nnnnnnnnnn 6360 
nnnnnnnnnn 6420 
nnnnnnnnnn 64 80 
nnnnnnnnnn 654 0 
nnnnnnnnnn 6600 
nnnnnnnnnn 6660 
nnnnnnnnnn 6720 
nnnnnnnnnn 6780 
nnnnnnnnnn 684 0 
nnnnnnnnnn 6900 
nnnnnnnnnn 6960 
nnnnnnnnnn 7 020 
nnnnnnnnnn 7 080 
nnnnnnnnnn 714 0 
nnnnnnnnnn 7 200 
nnnnnnnnnn 72 60 
nnnnnnnnnn 7 320 
nnnnnnnnnn 7 38 0 
nnnnnnnnnn 7440 
nnnnnnnnnn 7 500 
nnnnnnnnnn 75 60 
ctgcattgat 7620 
cggcatgaat 7680 
tcggcaccga 7740 
cgttcggttg 7800 
caatgagtag 7860 
tcttttatga 7920 
agcgttttgc 7980 
atggtgaagc 8040 
aagctgtcga 8100 
cggggcctcg 8160 
ataaaatctg 8220 
ccctgcattt 8280 
gggctgtttt 8340 
tctttctttg 8400 
ataaagttca 8460 
cccattcaca 8 520 
gcataatggg 8580 
cgtcggctac 8640 
ggcgcattca 8700 
tgctgatggc 8760 
aaagccacct 8820 
ccaatccggt 8880 
tgttaaaagc 8940 
agaaagcgat 9000 
gcgcctatgg 9060 
cggcctggcg 9120 
acagcctgtc 9180 
cggcgctgaa 9240 
acgacgccat 9300 
tgccgttcca 9360 
tgctgagcag 9420 
tgacgatggg 9480 
cgtcgagcct 9540 
cacatggcga 9600 
ttttcacgct 9660 
tgcaggcatg 9720 
ccgctcacaa 9780 
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ttccacacaa catacgagcc ggaagcataa agtgtaaagc ctggggtgcc taatgagtga 9840 
gctaactcac attaattgcg ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt 9900 
gccagctgca ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgct 9960 
cttccgctgg cgcgccaggt cgactctaga ggatccccgg ggaagatctt ccggaagatc 10020 
ttcccgagct cgaattaatt ccgcgatgaa ttgatcccgg aagttcctat tctctagaaa 10080 
gtataggaac tcgaattggt cgacaagcta gcttgcatgc aagcttgtat tctatagtgt 10140 
cacctaaatc gtatgtgtat gatacataag gttatgtatt aattgtagcc gcgttctaac 10200 
gacaatatgt acaagcctaa ttgtgtagca tctggcttac tgaagcagac cctatcatct 10260 
ctctcgtaaa ctgccgtcag agtcggtttg gttggacgaa ccttctgagt ttctggtaac 10320 
gccgttccgc accccggaaa tggtcagcga accaatcagc agggtcatcg ctagcccatg 10380 
gctaattctg tcagccgtta agtgttcctg tgtcactgaa aattgctttg agaggctcta 10440 
agggcttctc agtgcgttac atccctggct tgttgtccac aaccgttaaa ccttaaaagc 10500 
tttaaaagcc ttatatattc ttttttttct tataaaactt aaaaccttag aggctattta 10560 
agttgctgat ttatattaat tttattgttc aaacatgaga gcttagtacg tgaaacatga 10620 
gagcttagta cgttagccat gagagcttag tacgttagcc atgagggttt agttcgttaa 10680 
acatgagagc ttagtacgtt aaacatgaga gcttagtacg tgaaacatga gagcttagta 10740 
cgtactatca acaggttgaa ctgcggatct tgcggccgca aaaattaaaa atgaagtttt 10800 
gacggtatcg aaccccagag tcccgctcag aagaactcgt caagaaggcg atagaaggcg 10860 
atgcgctgcg aatcgggagc ggcgataccg taaagcacga ggaagcggtc agcccattcg 10920 
ccgccaagct cttcagcaat atcacgggta gccaacgcta tgtcctgata gcggtccgcc 10980 
acacccagcc ggccacagtc gatgaatcca gaaaagcggc cattttccac catgatattc 11040 
ggcaagcagg catcgccatg ggtcacgacg agatcctcgc cgtcgggcat ccgcgccttg 11100 
agcctggcga acagttcggc tggcgcgagc ccctgatgct cttcgtccag atcatcctga 11160 
tcgacaagac cggcttccat ccgagtacgt gctcgctcga tgcgatgttt cgcttggtgg 11220 
tcgaatgggc aggtagccgg atcaagcgta tgcagccgcc gcattgcatc agccatgatg 11280 
gatactttct cggcaggagc aaggtgagat gacaggagat cctgccccgg cacttcgccc 11340 
aatagcagcc agtcccttcc cgcttcagtg acaacgtcga gcacagctgc gcaaggaacg 11400 
cccgtcgtgg ccagccacga tagccgcgct gcctcgtctt ggagttcatt cagggcaccg 114 60 
gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg ctgacagccg gaacacggcg 11520 
gcatcagagc agccgattgt ctgttgtgcc cagtcatagc cgaatagcct ctccacccaa 11580 
gcggccggag aacctgcgtg caatccatct tgttcaatca tgcgaaacga tcctcatcct 11640 
gtctcttgat ccactagatt attgaagcat ttatcagggt tattgtctca tgagcggata 11700 
catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa 11760 
agtgccacct gc 11772 



